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Abstract 


As a critical adaptive mechanism, amino acid replacements on the severe acute respiratory syndrome coronavirus (SARS-CoV) spike protein 
could alter the receptor-binding specificity of this envelope glycoprotein and in turn lead to the emergence or reemergence of this viral zoonosis. 
Based on the X-ray structures of SARS-CoV spike receptor-binding domain (RBD) in complex with its functional receptor (angiotensin-converting 
enzyme 2, ACE2), we perform computational simulations of interactions between three representative RBD mutants and four host species-specific 
receptors. The comparisons between computational predictions and experimental evidences validate our structural bioinformatics approaches. And 
the predictions further indicate that some viral prototypes might utilize the rat ACE2 while rats might serve as a vector or reservoir of SARS-CoV. 
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1. Introduction 


Will the severe acute respiratory syndrome (SARS) reemerge 
in the coming years? This is a huge challenge of great concerns 
facing the whole scientific community especially virologists. 
As an envelope glycoprotein on the virion surface, the spike 
protein of SARS coronavirus (SARS-CoV) plays a crucial 
role in the virus entry (Hofmann and Pohlmann, 2004). This 
structural protein contains two functional regions: the exter- 
nal S1 region responsible for the initial attachment to cellular 
receptor and the internal $2 region contributing to the subse- 
quent fusion between viral envelope and cellular membrane. 
A membrane-associated zinc metallopeptidase, angiotensin- 
converting enzyme 2 (ACE2), was identified as the functional 
receptor for SARS-CoV (Li et al., 2003). And a 193-amino 
acid (residues 318-510) independently folded receptor-binding 
domain (RBD), harbored within the S1 region, could efficiently 
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bind human ACE2 (Wong et al., 2004). Moreover, the observa- 
tion that the murine ACE2, compared to its human homologue, 
highly limited the efficient replication of SARS-CoV human 
isolate TOR2 in mouse cells strongly suggested the presence of 
species-specific viral entry barriers and the absence of post-entry 
barriers. Furthermore, two naturally occurred amino acid substi- 
tutions (K479N and S487T) were experimentally confirmed to 
be the critical determinants for the transmission of this zoono- 
sis from civet cat to human (Li et al., 2005b). The recently 
solved crystal structures of the spike RBD in complex with its 
receptor, human ACE2, also demonstrated the significance of 
those two residue replacements in the adaptation of viral spike 
to human receptor (F. Li et al., 2005; Li et al., 2006). Clearly, 
the ACE2—RBD complex structure would undoubtedly facili- 
tate the researches on the receptor recognition of SARS-CoV 
and the adaptive mechanism of virus crossing species barriers. 
And the recently developed structural bioinformatics program 
(Liu and Kuhlman, 2006) offer us a powerful tool to elucidate 
structural and energetic factors involved in SARS emergence 
or reemergence. In this study, we conducted structure-based 
computational simulations of interactions between three repre- 
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sentative SARS-CoV RBD mutants (TOR2, GD and SZ3) and 
four host species-specific receptors (human, civet cat, mouse 
and rat) for assessment of their performance to predict viral 
crossing-barrier events. 


2. Materials and methods 


We retrieved the coordinates data of SARS-CoV spike RBD 
complexed with human ACE2 peptidase domain (F. Li et al., 
2005) (PDB code: 2AJF) from the protein data bank (PDB) 
(Berman et al., 2000). There are two complexes in the asymmet- 
ric unit of crystal (F. Li et al., 2005). The structural differences 
between the two complexes are mainly due to the conforma- 
tional changes of cellular receptors rather than those of viral 
RBDs. As a membrane-associated vasopeptidase, ACE2 had a 
shell-like catalytic extracellular domain (Towler et al., 2004), 
and its active site is deeply buried in a long cleft whose two 
sides are just the two lobes of the shell. After the engagement of 
substrate or inhibitor, the two lobes will exhibit a large hinge- 
bending movement toward each other to put relevant residues 
into right positions for catalysis. In the coordinates file, one 
ACE2-RBD complex (the complex AE with the chain A for 
ACE2 and the chain E for RBD) contains a fully open receptor 
while the other (the complex BF with the chain B for ACE2 and 
the chain F for RBD) has a slightly closed counterpart. Thus, 
as revealed in previous studies (F. Li et al., 2005; Towler et al., 
2004), the former enzyme was ina free form whereas the latter in 
an inhibitor-bound state. However, both biochemical and struc- 
tural researches (F. Li et al., 2005; Li et al., 2005b; Wong et al., 
2004) indicated that the human receptor-binding activity of RBD 
is independent of this open-to-closed conformational transition. 
Hence, the two complex structures are respectively subject to 
structural bioinformatics analyses for further comparison. The 
in silicon analyses included three steps. Firstly, the contact map 
analysis (CMA) (Sobolev et al., 2005) was employed to define 
the contacting residues across the complex interfaces. Here, 
we focused on the two RBD residues (ASN479 and THR487) 
because their importance in the adaptation of SARS-CoV spike 
protein to human ACEZ2 has been biochemically and structurally 
clarified (F. Li et al., 2005; Li et al., 2005b). Additionally, those 
spike or ACE2 residues forming side chain contacts with the 
two focused positions are also under study due to their potential 
roles involved in the geometric and chemical complementarity 
of binding surface. After the definition of contacting residues, 
the RosettaDesign web server (Liu and Kuhlman, 2006) is con- 
sulted to redesign the interactions between RBD and receptor. 
In this step, the RBD positions 479 and 487 plus their con- 
tacting sites on the two binding partners are virtually mutated 
to mimic the contacts between the RBDs of three representa- 
tive viral isolates and the ACE2 of four host species (F. Li 
et al., 2005; Li et al., 2005b). The three viral strains includes 
TOR2 (GenBank Accession No.: AY274119), GD (GenBank 
Accession No.: AY525636) and SZ3 (GenBank Accession No.: 
AY304486) which are isolated from the human patient during 
the serious pandemic in late 2003 (Marra et al., 2003; Rota et al., 
2003), from the human patient of the mild infection case in the 
reemergence of 2004 (Consortium, 2004), and from the civet cat 


Table 1 
Redesigned residues of three representative viral isolates spike receptor-binding 
domain 


479 487 

TOR2 N T 
GD N S 
SZ3 K S 
Table 2 
Redesigned residues of four host species ACE2 receptor 

31 34 353 
Civet cat T Y K 
Human K H K 
Mouse N Q H 
Rat K Q H 


in the exotic market in early 2003 (Guan et al., 2003), respec- 
tively. Four host species comprised human (GenBank Accession 
No.: QOBYF1), civet cat (GenBank Accession No.: Q56NL1), 
mouse (GenBank Accession No.: Q8ROIO) and rat (GenBank 
Accession No.: Q5EGZ1). Both the viral isolates and the host 
species were selected corresponding to previous virological 
studies (F. Li et al., 2005; Li et al., 2004, 2005b). As shown in 
Tables 1 and 2, the RBD positions 479 and 487 plus their con- 
tacting ACE2 sites of species-specific residue are redesigned 
according to the sequence content variations of viral isolates 
and host receptors identified as crucial determinants for recep- 
tor recognition (F. Li et al., 2005; Li et al., 2006, 2005b). Those 
residues, within RBD or ACE2, forming side chain contacts with 
the redesigned residues, are also repacked to accommodate the 
structural variations. Except the two categories of residues to 
be redesigned or repacked, all the others remained untouched in 
the complex structure. These computational simulations could 
serve as a reliable indicator for the validation of our struc- 
tural bioinformatics approaches in this case, depending on its 
reproduction of receptor-binding pattern generated in the exper- 
imental investigations. Finally, the redesigned ACE2—RBD 
complex structure models are submitted to the FOLDX web 
server (Schymkowitz et al., 2005), an online empirical force 
field, to assess the effect of mutations on the binding free energy 
of complex and to measure the differences in the association 
energy between the wild type and the mutants. In princi- 
ple, the lower the binding free energy, the higher the binding 
affinity. 


3. Results and discussion 


Structural bioinformatics analyses of the ACE2—RBD com- 
plexes were performed using a three-step procedure as described 
in Section 2. The calculated binding free energies of the wild type 
and virtual mutant complex structures are shown in Table 3. 
Here, the agreement or disagreement between the computa- 
tional predictions and the experimental evidences is paid special 
attention for the validation of our structural bioinformatics 
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Table 3 
Calculated binding free energies of the wild type and virtual mutant ACE2-RBD 
structures based on the sister complexes AE and BF 


GD SZ3 TOR2 

Complex AE 
Civet cat —13.2 —11.8 —14.5 
Human -14.1 —10.9 —15.8 
Mouse —12.0 —12.3 —12.3 
Rat —12.0 -14.4 —12.3 

Complex BF 
Civet cat —12.2 —14.5 -17.7 
Human —10.7 —12.3 —14.0 
Mouse —12.1 —14.0 —12.4 
Rat -12.1 —14.0 —12.4 


The unit for binding free energy is kcal/mol. The viral isolates include GD, SZ3 
and TOR2. The host species comprise civet cat, human, mouse and rat. The 
energy values of wild type complexes are shown in boldface. 


approaches and the discrimination of predictability between the 
sister complexes AE and BF. For instance, both pseudovirus 
infections and binding assays reveals that all the three RBDs 
mentioned above could attach to the civet cat receptor whereas 
only the TOR2 RBD is able to efficiently utilize the human ACE2 
(Li et al., 2005b). In addition, the TOR2 RBD is shown to lack 
the rodent receptor-binding activity (Li et al., 2004, 2005b). As 
for the models derived from the AE complex, the civet cat ACE2 
attaches to the RBDs of TOR2, GD and SZ3 with the binding 
free energies in an ascending order (the binding affinities in 
a descending order), being congruent with the observations in 
binding assays (Li et al., 2005b). And a similar rank of RBD 
association energies is also found for the human receptor, com- 
pletely consistent with the data of pseudovirus infections and 
binding assays (Li et al., 2005b). In particular, the wild type com- 
plex AE (TOR2-human), representing the interactions between 
human receptor and the human-isolated TOR2 RBD obtained 
during the serious emergence of SARS in 2003, holds the low- 
est energy (— 15.8 kcal/mol) among the 12 AE-based complexes. 
While the highest binding energy score (—10.9 kcal/mol) is 
rewarded to the mutant complex (SZ3—human) mimicking the 
contacts of civet cat-isolated SZ3 RBD with human ACE2. In the 
middle of the two extremes lies in the energy value of the mutant 
complex (GD-human) modeling associations between human 
ACE2 and the human-isolated GD RBD in the reemerging 
case of SARS with mild symptoms. The significant differences 
in the human receptor-binding energy or affinity among the 
three viral RBDs, especially that between the two extremes, 
offer us an excellent interpretation for the human receptor- 
binding specificity differentiation of SARS-CoV spike protein 
given the high correlation with biochemical evidences (Li et al., 
2004, 2005b). In addition, the equally higher binding energies 
(—12.3 kcal/mol) of rodent ACE2 with TOR2 RBD success- 
fully account for the viral entry barriers posed by the rodent 
receptors (Li et al., 2004). Collectively, the virtual substitutions 
strongly suggest the validity of our structural bioinformatics 
protocols and the predictability of AE complex. Notably, this 
directly leads to the following speculation that rats might be a 
vector or reservoir for some prototypes of SARS-CoV due to 


the calculated binding free energy of SZ3-rat complex being 
comparable to those of TOR2-civet cat and GD—human com- 
plexes. In other words, a probable cross-species transmission 
chain of SARS-CoV is from previously identified bats (Li et al., 
2005a) through rats and subsequently civet cats (Consortium, 
2004; Guan et al., 2003) to human, coupled with molecular adap- 
tation to host species-specific factors via high-rate mutations. In 
fact, the possible vector role of rats in the SARS outbreak at 
Amoy Gardens has been previously implied (Ng, 2003) and the 
confirmed first case of human SARS-CoV infection was reported 
to have only contacts with rats (Liang et al., 2004). In particular, 
avery recent review (Wang et al., 2006) suggested serologic evi- 
dences of rats susceptible to SARS-CoV whereas the detailed 
information still waits for publication. The viral strain they used 
might account for the sharp conflict between in vivo and in vitro 
experimental observations. Certainly, the molecular epidemio- 
logical surveillance and subsequent biochemical investigations 
are clearly necessary to test this hypothesis. As proposed pre- 
viously (Holmes and Rambaut, 2004), the surveillance should 
be focused on the high population density wildlife spatially and 
temporally in close contacts with human community, such as 
birds, bats and rodents. In addition, the sampling and sequenc- 
ing of those animal ACE2 in combination with the computational 
approaches adopted here could be informative for the rapid iden- 
tification of viral mutants of high likelihood for the utilization 
of both animal and human receptors, the co-infection of tar- 
get cells of different host species, and the expansion of host 
range, or even worse the reemergence of this zoonosis in human 
population. 

In sharp contrast to the predictions based on the complex 
AE, the calculations derived from the complex BF display an 
unexpected pattern in conflict with biochemical evidences. For 
instance, the binding energy of the TOR2-civet cat complex is 
the minimum among those of all the 12 BF-based structures and 
even markedly lowers than that of the wild type (TOR2—human) 
with a difference of more than 3 kcal/mol. However, pseudovirus 
infections and binding assays (Li et al., 2005b) do not support 
this indication but find that the TOR2-civet cat complex has an 
equivalent binding affinity with reference to the TOR2—human 
complex. Besides, disagreements between computations and 
experiments are simultaneously found for interactions of human 
or civet cat receptor with the other two viral RBDs (SZ3 and 
GD). For each of the two receptors, the SZ3 RBD binds to 
ACE2 with a lower free energy than the GD RBD, opposite 
to the rank speculated from biochemical analyses (Li et al., 
2005b). Obviously, the energetic effects of viral strain-specific 
or host species-specific variations on the RBD-—ACE2 complex 
formation under the condition of inhibitor-bound receptor still 
need further experimental characterization. But in this case, 
the poor predictability of BF complex prohibits it from further 
consideration. 
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